Continuous Voice Morphing Using Separated Vocal Tract Area Functions and Glottal Source Waves
نویسندگان
چکیده
This paper presents a flexible voice morphing method, which is based on a conversion using a linear combination of the vocal tract area functions estimated from speech signals. The method focuses on the continuity of the phonological identity of the overall interpolated area. The main features of the method are 1) to separate characteristics of the vocal tract resonances from those of glottal source waves using AR-HMM analysis of speech, 2) independent morphing of the vocal tract resonances and glottal source wave characteristics, and 3) a non-linear interpolation in a log vocal tract area function domain. The method employs a statistical mapping on the log vocal tract area function domain and the cepstrum domain for the glottal source wave. We establish that a morphing system constructed from the proposed method improves the continuity of formants and the speech quality in the intermediate morphing rate. KeywordsVoice Conversion; AR-HMM Analysis; Statistical Mapping; Continuity Of Phonological Identity
منابع مشابه
Voice morphing based on interpolation of vocal tract area functions using AR-HMM analysis of speech
This paper presents a new voice morphing method which focuses on the continuity of phonological identity overall interand extra-polated regions. Main features of the method are 1) to separate the characteristic of vocal tract area resonances from that of vocal cord waves by using AR-HMM analysis of speech, 2) interpolation in a log vocal tract area function domain and 3) independent morphing fo...
متن کاملAcoustics of Human Voice Production
Human voice production is studied as an acoustic process inside the vocal tract using the wave equation for air, and compared with large-scale databases of simultaneously acquired microphone and electroglottograph signals. The following theories are confirmed: For voiced sounds, the process starts at glottal closures. Immediately before a glottal closure, there is a continuous airflow in the vo...
متن کاملAnalysis of a Modern Voice Morphing Approach using Gaussian Mixture Models for Laryngectomees
This paper proposes a voice morphing system for people suffering from Laryngectomy, which is the surgical removal of all or part of the larynx or the voice box, particularly performed in cases of laryngeal cancer. A primitive method of achieving voice morphing is by extracting the source's vocal coefficients and then converting them into the target speaker's vocal parameters. In this ...
متن کاملGlottal source - vocal tract acoustic interaction
Recent developments within our group of voice source vocal tract acoustic interaction are reviewed. Special emphasis is layed on nonlinear superposition phenomena, i.e., how the excitation within a period is dependent on the past history of vocal tract oscillations and their residual components within the transglottal pressure. A study of breathy phonation shows that constant leakage affects th...
متن کاملGlottal source and vocal-tract separation Estimation of glottal parameters, voice transformation and synthesis using a glottal model
This study addresses the problem of inverting a voice production model to retrieve, for a given recording, a representation of the sound source which is generated at the glottis level, the glottal source, and a representation of the resonances and anti-resonances of the vocal-tract. This separation gives the possibility to manipulate independently the elements composing the voice. There are man...
متن کامل